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Chromatin immunoprecipitation coupled with DNA sequencing (ChlP-seq) is the major contemporary 
method for mapping in vivo protein-DNA interactions in the genome. It identifies sites of transcription 
factor, CO factor and RNA polymerase occupancy, as well as the distribution of histone marks. Consortia such 
as the ENCyclopedia Of DNA Elements (ENCODE) have produced large datasets using manual protocols. 
However, future measurements of hundreds of additional factors in many cell types and physiological states 
call for higher throughput and consistency afforded by automation. Such automation advances, when 
provided by multiuser facilities, could also improve the quality and efficiency of individual small-scale 
projects. The immunoprecipitation process has become rate-limiting, and is a source of substantial 
variability when performed manually. Here we report a fuUy automated robotic ChIP (R-ChIP) pipeline that 
allows up to 96 reactions. A second bottleneck is the dearth of renewable ChlP-validated immune reagents, 
which do not yet exist for most mammalian transcription factors. We used R-ChIP to screen new mouse 
monoclonal antibodies raised against p300, a histone acetylase, well-known as a marker of active enhancers, 
for which ChlP-competent monoclonal reagents have been lacking. We identified, vaUdated for ChlP-seq, 
and made publicly available a monoclonal reagent called ENCITp300-l. 



^ ^ J II ^ ontemporary studies of gene regulation are often based, at least in part, on learning the patterns of 

con ri u e equa y o a chromatin mark distribution and the locations of specific transcription factor occupancy in the genome, 
this v/ork. 'pjjg chromatin immunoprecipitation (ChIP) assay, in several variations, provides this information' 

Chip protocols typically begin by cross-linking proteins to DNA (usually with formaldehyde); then selectively 
retrieving DNA fragments associated with a protein of interest by immunoprecipitation; and finally analyzing the 
enriched DNA. Originally, ChlP-enrichment was analyzed using qPCR at predefined genomic regions'*. Later, it 
was coupled with microarray readouts (ChlP-chip/ChlP-on-chip) which allowed many selected regions to be 
assayed in parallel (e.g. all promoters) or even whole genomes, especially in organisms with small genomes^"'. 
Eventually, high-throughput sequencing enabled truly genome-wide mapping of protein-DNA interactions, with 
high resolution, in the form of ChIP-seq'° 

ChlP-seq has become the workhorse for mapping the whole-genome occupancy and genomic distribution of 
hundreds of transcription factors and numerous histone modifications in a wide variety of human, mouse, and 
worm cell lines and tissues by the ENCODE'^"*, mouse ENCODE" and modENCODE consortia^"'^', and the 
NIH Roadmap Epigenomics Mapping Consortium^''. Despite the large number of datasets generated thus far, 
they are a small fraction of the expected future experiments from individual laboratories as well as consortia. 
Initially, DNA sequencing capacity and cost were major barriers to large scale ChlP-seq, but sequencing capacity 
has increased by several orders of magnitude and costs per ChIP have dropped significantly. The immunopre- 
cipitation step has now emerged as rate-limiting. It is tedious, and in practice it is often variable from one 
practitioner to another, from experiment to experiment, and even among replicates in a single experiment. 
This suggested that a robust robotic ChIP protocol could stabilize and improve data quality, reproducibility, 
manpower use, and overall costs and efficiency per experiment. An automated system would offer these benefits 
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to individual laboratories doing small numbers of experiments, 
through core facilities, in addition to enabling large-scale projects 
and consortia. 

A second independent challenge for contemporary ChlP-seq 
experiments is that the supply of high-quality sustainable immune 
reagents that have been experimentally validated for ChIP remains 
very limited. Many antibodies, including some marketed as "ChlP- 
grade" have failed in the ENCODE pipeline, and many that have 
succeeded are polyclonal, which means that different lots can vary 
radically in how well they perform in ChlP^'. At present, monoclonal 
antibodies are the most reliable renewable ChIP reagents, although 
they do not account for the majority of characterized reagents, and 
there are no ChlP-competent reagents for the majority of human and 
mouse transcription factors. The field therefore faces the twin chal- 
lenges of generating large quantities of ChlP-seq data in reliable 
high-throughput manner for factors with extant affinity reagents, 
and having to screen and characterize new sustainable immune 
reagents. 

In this work we develop a fuUy automated robotic pipeline for the 
chromatin immunoprecipitation reaction (R-ChIP). High-through- 
put 96-well plate methods for performing ChIP have been described 
before^'*'^^. However, those methods require substantial hands-on 
time and are subject to variability inherent in experiments done by 
humans. A conceptually similar robotic approach was recently 
developed independently^*", though it differs from the one presented 
here in requiring manual intervention at several steps. The R-ChIP 
protocol reported here is fully automated and employs a widely used, 
multipurpose programmable liquid handling robotic platform 
(Tecan Freedom EVO 200), which can be used for a multitude of 
other purposes, such as robotic plasmid cloning or automated ELISA 
screenings when it is not being used for ChIP. We test our protocol 
on factors that have previously been characterized in multiple 
ENCODE cell lines and show that it performs comparably to high 
quality manual ChlP-seq in enrichment and in producing ChlP-seq 
libraries that are consistent within and between experiments. We 
then appUed R-ChIP to screen candidate monoclonal antibodies 
directed against the transcriptional co-activator p300, a protein for 
which monoclonal ChlP-competent reagents have until now not 
been available, and for which polyclonal reagent lots have been 
highly variable. 

Results 

Automated ChIP protocol adaptations. The primary goal of this 
work was to fully automate ChIP without compromising yield and 
quality. Our design approach was to develop automation that mimics 
as closely as possible the established manual process, using the 
ENCODE Chip protocol as the starting point (the current manual 
ENCODE Chip protocol is provided in the supplementary Appen- 
dix). Where substantial changes were made to accommodate auto- 
mation, the we benchmarked the new process against results from 
the established protocol. 

We configured a Tecan Freedom EVO 200 robot as detailed 
(Figure 1 and Supplementary Figure 2), and programmed it for run- 
ning automated chromatin immunoprecipitation reactions in a 96- 
well format (the program itself is supplied as a supplementary file). 
Major considerations for automating ChIP revolved around mag- 
netic bead-handling to achieve successful incubation, washing, and 
recovery of immunoprecipitated material, while effectively eliminat- 
ing unbound chromatin. In the manual version of ChIP, bead agita- 
tion is achieved by tumbling the reaction mix in standard Eppendorf 
1.5-mL micro tubes on a tumbler wheel. The agitation device avail- 
able on the robot is an orbital shaker with a 2-mm shake radius and 
adjustable speeds ranging from 100 rpm to 1600 rpm. An alternate 
method for automated bead agitation mixes by repeated pipetting 
(trituration). We reasoned that pipetting would lead to inevitable 
bead losses to the pipette surface, especially as multiple tip changes 



would be required. We therefore focused on the orbital shaker. The 
second automation constraint comes from the 96-well plate format 
compared with individual microtubes in the manual protocol. This 
change requires effective robotic washing without cross-contamina- 
tion between wells or sample spillage. 

Finally, the 96-well format requires a plate magnet strong enough 
to efficiently pull down all beads. Several vendors offer plate magnets 
compatible with the robot platform, but most are designed for stand- 
ard low profile micro plates. In contrast, our automated ChIP pro- 
tocol requires deep-well plates for effective bead washing. A magnet 
designed specifically for deep-well plates (SPRIPlate Super Magnet 
Plate from Agencourt, Beckman Coulter) proved effective. Its success 
in our hands was optimal with a round-well-deep well plate with U- 
bottom wells (catalog # 278752, Nunc). A summary of major differ- 
ences for the robotic protocol is below and both protocols are given in 
detail in the Supplementary Methods: 

1. Bead agitation was changed from a tumbling motion in the 
manual protocol to rapid orbital shaking. The shake speed 
was optimized to keep beads fully suspended without spillage 
(1400 rpm). 

2. The sample volume was reduced from 1000 jiL to 500 /(L to 
prevent spillage. 

3. Wash steps after antibody and chromatin binding were 
increased in number from 3 to 4 to compensate for the smaller 
wash volume. 

4. Bead recovery time on the magnet was extended to 7 min on 
the robot, a condition determined empirically using the cri- 
terion that no detectable beads were left behind in the super- 
natant upon microscopic inspection. 

5. The step that dissociates bound chromatin from antibody-mag- 
netic beads is done in the robot's PCR module, thus eliminating 
constant bead agitation in the manual protocol during the 65°C 
1 hour incubation. The beads are now suspended by the orbital 
shaker after the 1 hour incubation. 

Consistency of robotic ChIP results. We first tested the robustness 
and reproducibility of our robotic ChIP protocol by carrying out 
multiple manual and R-ChIP experiments for the NRSF/REST 
transcription factor. NRSF/REST^'""* is a negative transcriptional 
regulator of neuronal genes in non-neuronal cell types. It was the 
first transcription factor to which ChlP-seq was applied", its binding 
has been extensively mapped in multiple cell lines, and its recogni- 
tion site (and its binding variants) is well studied. The monoclonal 
antibody used for NRSF ChIP has been well characterized in the 
ChlP-seq format. It is thus a good system to characterize the method. 

We performed ChlP-seq experiments in two cell lines, GM12878 
and Jurkat, producing at least three libraries from four separate plates 
for GM12878 and from four separate plates for Jurkat. We compared 
the resulting data to existing manually generated NRSF ChlP-seq 
datasets for GM18278 cells"" and to four additional manual ChlP- 
seq datasets generated in parallel with the R-ChIP ones. These data 
are summarized in Figure 2. 

To assess ChIP quality, we used library and ChIP QC metrics that 
were developed previously by us and others as part of the ENCODE 
Consortium^'-'". The first question regarding ChIP quality is how 
well the immunoprecipitation step has enriched for DNA fragments 
attached to the antigen of interest. This can be assessed by calculating 
the fraction of reads falling within called peaks (FRiP^') or by using 
cross-correlation''"'". Both measures have limitations in some special 
cases'"", but when both are applied and concur, confidence in the 
results is high. Figure 2A shows the number of called peaks and 
Figures 2B and 2C show the RSC (Relative Strand Correlation^') 
and FRiP values for manual and robotic NRSF ChlP-seq datasets. 
R-ChIP data consistently exhibited good RSC values (RSC s 1) and 
FRiP and peak number values comparable to those of manually 
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Figure 1 | Illustration of individual automated ChIP protocol steps. A Tecan Freedom EVO 200 robot equipped with a Liquid Handling arm (LiHa), a 
Multi Channel Arm (MCA) and Robotic Manipulator arm (RoMa) is used for all steps. Additional devices integrated into the robot are standard- 
size plate carriers, magnet plate, orbital plate shaker and PCR machine. The cartoons in the left column illustrate each protocol step, described in the flow 
diagram in the second column. The cartoon sequences on the right illustrate the robotic process step sequences used for each protocol step. The white 
arrows pointing to the protocol steps indicate which robot sequences apply to each protocol step. 



generated libraries, with the exception of three Jurkat libraries (the 
first Chip on plates 2, 3, and 4, Figure 2A, 2B and 2C) that scored as 
less successful. We do not presently know the cause of these lower- 
quality libraries, but their frequency is well within the range of vari- 
ability of manually generated libraries we have observed over several 
years, during which sporadic unsuccessful experiments for factor/ 
antibody pairs that are otherwise routinely successful have occurred. 
Finally, we asked how similar the final sets of called peaks are for the 



robotic protocol and how they compare with reference manual data- 
sets for the same factor and cell type, by evaluating peaks called after 
sequencing. Figures 2D and 2E show the similarity of peak call sets 
for all libraries measured by calculating the size of the overlap 
between each pair of libraries. Overall, we observed consistently high 
overlap scores and thus high reproducibility between libraries. These 
observations applied both within and between plates, underscoring 
the consistency and robustness of the R-ChIP protocol. 
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Figure 2 | Reproducibility of R-ChIP experiments. Multiple ChlP-seq experiments on multiple plates were generated for the NRSF/REST repressor in 
GM12878 lymphoblastoid cells (n = 4 plates) and Jurkat T-cells ((7 = 4 plates) cell lines. The numbers (1 through 5) refer to the number of the plate a 
Ubrary came from, "M" refers to manually generated datasets. The first two manual GM 12878 datasets were previously published as part of the ENCODE 
project, the next four were generated in parallel with the R-ChIP ones. (A) Number of called regions for each dataset (using ERANGE 4.0") (B) 
Assessment of ChIP enrichment using RSC (Relative Strand Correlation) cross-correlation scores^'; (C) Assessment of ChIP enrichment using FRiP 
(Fraction of Reads in Peaks) scores (Landt et al. 2012); (D) Overlap between called peaks in robotic and manual ChIP libraries in GM12878 cells; (E) 
Overlap between called peaks in robotic and manual ChIP libraries in Jurkat cells. The overlap score ( Oxy) shown in each box indicates the fraction of 
peaks in the dataset on the y-axis that are also found in the dataset on the x-axis, i.e. Oxy = \X H Y\/\Y\. 
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Figure 3 | Comparison between manual CHIP and R-ChIP results for additional targets. Manual and robotic ChlP-seq datasets were generated in 
parallel using the same chromatin samples and the result compared. (A,B,C) ChlP-seq against H3K27ac in GM12878 cells. (A) FRiP score, (B) number of 
peaks called, (C) overlap between the sets of peaks; (D,E,F) ChlP-seq against GABP in GM12878 cells. (D) FRiP score, (E) number of peaks called, (F) 
overlap between the sets of peaks; (G,H,I) ChlP-seq against ZBTB33 in GM12878 cells. (G) FRiP score, (H) number of peaks called, (I) overlap between 
the sets of peaks; (J), (K), (L) ChlP-seq against PU. 1 in GM12878 cells. (J) FRiP score, (K) number of peaks called, (L) overlap between the sets of peaks. 
The overlap score (Oxy) shown in each box indicates the fraction of peaks in the dataset on the y-axis that are also found in the dataset on the x-axis, i.e. 
OxY = IX n 71/171. 



To further characterize the consistency between the results from 
R-ChIP and manual ChIP experiments, we generated paired manual 
and robotic ChlP-seq datasets using matched chromatin samples for 
several additional targets (Figure 3). These included the H3K27ac 
histone modification (Figure 3A, 3B and 3C), the GABP transcrip- 
tion factor'^ (Figure 3D, 3E and 3F), the ZBTB33/Kaiso zing-finger 
protein known for its preferential binding to methylated DNA'''' 



(Figure 3G, 3H and 31), and the important regulator of hematopoi- 
esis SPIl/PU.l"^''^ (Figure 3J, 3K and 3L). We observed comparable 
results between the manual and robotic datasets, further confirming 
the applicability of R-ChIP to large-scale ChlP-seq production. 

Using R-ChIP to characterize new monoclonal p300 antibodies. 

Having established the R-ChIP protocol, we next applied it to 
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characterize a set of monoclonal antibodies raised against the p300 
transcriptional co-activator in the Beckman Institute Protein 
Expression Center and Monoclonal Antibody Facility at Caltech. 
The p300 protein is a histone acetyltransferase, best known for its 
role in the acetylation of histones^*"'". It is used as a marker of active 
transcriptional enhancers in mammalian genomes''^"*^. Commer- 
cially available antibodies used to generate published p300 data are 
from a series of polyclonal nonrenewable reagents that have a 
reputation for lot-to-lot variability. 

We generated 11 0!-p300 mouse monoclonal antibodies which 
were initially screened, cloned and then rescreened using a plate- 
based ELISA assay. We tested hundreds of individual hybridoma 
B-cells isolated after fusion with cells from spleens of mice injected 
with a GST-tagged p300 protein fragment (aa 152-213) or a syn- 
thetic KLH-coupled peptide (aa 1526-1545). The GST-tagged pre- 
parations were subjected to formaldehyde fixation (1% FA for 
10 min) before immunization with the goal of increasing the like- 
lihood of reactivity in ChlP. The resulting 1 1 p300 monoclonal anti- 
bodies were tested for ChIP together with two lots of rabbit 
polyclonal p300 antibodies (Santa Cruz sc-585, lot numbers F2711 
and E3113) on chromatin from GM12878 cells. The resulting data- 
sets were compared to each other and to publicly available ENCODE 
p300 data from the same cells (using two commercially available 
rabbit polyclonal antibodies, Santa Cruz sc-585 and sc-584) 
(Figure 4). Three of the mouse monoclonal antibodies raised against 
the synthetic KLH-coupled p300 peptide scored positive by ChlP- 
seq, identifying between 1,524 and 4,870 peaks (Figure 4A and 4B). 
We sequenced multiple additional replicates for the best-scoring one, 
1F4-E10P and identified an even higher number of peaks in some of 
the datasets, up to 8,430, with the typical number being —6,000. The 
peaks called in the monoclonal antibody dataset are a subset of those 
found in the polyclonal data (Figure 4C) confirming the specificity of 
the antibodies towards p300. While the monoclonal numbers are 
lower than the two most successful polyclonal datasets, they are 
within the range of what was previously observed in ENCODE data, 
and also within the range of published p300 datasets. We have 
renamed p300 1F4-E10P as ENCITp300-l and submitted it under 
this name to the Developmental Studies Hybridoma Bank (http:// 
dshb.biology.uiowa.edu/). 

It was not our purpose in this study to characterize new polyclonal 
reagent lots, but the p300 reagents used previously by ENCODE were 
no longer available. We therefore used two additional rabbit poly- 
clonal antibodies in R-ChIP (Santa Cruz sc-585, lot numbers E3113 
and F2711), and they identified up to —30,000 peaks. This number 
greatly exceeds previously published p300 datasets, including cur- 
rently available ENCODE data for the same GM12878 B-cell line (for 
which between 2,610 and 12,924 peaks were called previously) 
(Figure 4A and 4B). This increase has two likely causes, and they 
are not mutually exclusive. The first well-appreciated variable is 
different performance by polyclonal antibody lots. In principle, indi- 
vidual lots can vary in the number and identity of epitopes recog- 
nized, in effective antibody concentration, and in non-specific 
reactivity. A second difference from the prior ENCODE data is the 
fixation condition. For p300, we fixed cells at 37°C for 30 minutes, 
versus room temperature for 10 minutes for the historic ENCODE 
data. This condition was suggested to us by Dr. Bing Ren, UC San 
Diego, and is based on the idea that a longer time and elevated 
temperature might increase p300 cross-linking via indirect links to 
DNA-bound transcription factors or histones. This condition sig- 
nificantly improves p300 ChIP in our hands (for example, we gen- 
erated datasets using the 1F4-E10P antibody on chromatin fixed 
under standard conditions and they were all unsuccessful; 
Supplementary Figure 4). Of the 30,000 p300 peaks called using 
the 37°C fixation condition, the majority (between 76% and 88%) 
overlap with one or more chromatin marks associated with enhan- 
cers and promoters in ENCODE data (H3K27ac; H3K4mel) or with 



regions of DNAse hypersensitivity (Supplementary Figure 3). This is 
consistent with these regions being active enhancers and promoters. 
For multiple cell types, the numbers of DNAse hypersensitive 
regions'" '", H3K27ac and H3K4mel positive regions, reported prev- 
iously are typically in the tens of thousands (ENCODE Project 
Consortium 2012), and the number of expressed genes per cell type 
is between 5 and 10 X 10'. Thus the expected number of enhancers 
(and potentially p300-positive regions) is larger than the single-digit 
thousands of p300 peaks called in most previously available data. 
While reagent-specific background, including possible polyclonal 
cross-reactivity, could explain the modest number of p300 peaks that 
lack additional enhancer or promoter marks, the most parsimonious 
explanation for the overall very large number of newly revealed p300 
peaks is that prior ChIP measurements have underestimated p300 
occupancy. The monoclonal peaks also correspond preferentially to 
the strongest sites called for the polyclonals under the 37°C fixation 
condition (Supplementary Figure 8). Our best-performing monoclo- 
nal antibody did not produce comparably high peak numbers using 
the same chromatin substrate, but 99% of its peaks overlap those 
called in the polyclonal datasets. We note that the polyclonal anti- 
body lots used here are also no longer in stock from the 
manufacturer. 

We then tested additional factors with the 37°C fixation condition. 
Results were very similar to those with the standard condition for 
NRSF, H3K27ac and GABP (Supplementary Figure 5, Supplemen- 
tary Figure 6A-F), suggesting that the more aggressive fixation con- 
dition does not result in general nonspecific background. However, 
for PU.l and ZBTB33, the 37°C fixation led to fewer peaks and 
overall lower quality ChIP data (Supplementary Figure 6G-L). As 
discussed below, our data suggest that the impact of fixation con- 
dition on Chip outcome might be more complex than is generally 
acknowledged, and that a wider survey of factor-antibody pairs rela- 
tive to fixation conditions will be needed. 

Discussion 

The robotic ChIP (R-ChIP) reported here was developed on a widely 
used commercial liquid-handling platform whose configuration and 
running program for ChIP are provided. It was developed to increase 
ChlP-seq throughput, uniformity, and quality, while reducing 
investigator tedium and error. It should be equally applicable for 
large-scale projects or for smaller, individual-lab projects carried 
out in multiuser core facilities. 

Our R-ChIP results were comparable in quality to those from a 
large-scale manual pipeline by multiple metrics. However the plat- 
form's performance is not perfect. We expect that we and others will 
continue to make improvements. Specifically, we have observed 
sporadic single reaction failures among replicate samples within an 
individual plate, possibly associated with edge and corner well posi- 
tions (Supplementary Figure 7), but sporadic failures are also 
observed with manual experiments. It is our standard practice to 
include on each R-ChIP plate a minimum of triplicate control sam- 
ples deployed across the plate geometry. For these controls, we use a 
monoclonal reagent and a large batch of control chromatin that 
permits comparisons between plates through time. This allows us 
to evaluate each plate run and to compare it with other runs. This 
evaluation can be done as a QC step before committing to building 
and sequencing all remaining libraries. This gating step has clear 
economic implications. 

Chip troubleshooting is aided by R-ChlP. If a group of failed 
samples are embedded in a large R-ChIP run where the controls 
and other samples are successful, it becomes unlikely that the ChIP 
process is the source of failure, and a user can turn attention to the 
input sample and immune reagent (or any post-ChIP variation) as a 
more likely problem. Of course, the overall success of a ChlP-seq 
measurement includes the local DNA sequencing process,with spe- 



SCIENTIFIC REPORTS | 4:5152 | DOI: 1 0. 1 038/srep051 52 



6 



SC585SYDH Rep2 
SC585 SYDH Rep1 
SC584 SYDH Rep2 
SC584 SYDH Rep1 
SC585 F2711 
SC585 F2711 
SCS8S E3113 
SCS85 E3113 
SC585 E3113 
1F4_E10P 
1F4_E10P 
1F4_E10P 
1F4_E10P 
1F4_E10P 
1F4_E10P 
7H5_F2P 
5F7_C9P 
5D2_A1P 
4C5_A1 
3H6_B6 
3B4_G6P 
2F6_F7 
2F4_A8 
2E10_D10P 
1A3 F8P 



Number called peaks 



B 



I — I 



□ 
□ 



Chip enrichment (FRiP) 



□ 

□ 
□ 
□ 
□ 



sc585 Rep2 SYDH 
sc5S5 E3II3I 
5C585 E3II3I 
SC585 E3113 
sc585 F271l| 
SC585 F2711 



6374 t33i 6725 S430 61B1 4S7D 2868 1524 3110 U9Z4 451D 8267 2610 




ss 






1SDS3 


1.00 


0.74 


0 70 


081 


0.70 


0 62 


0 42 


□ 23 






0.77 


0.45 


0.71 


0 24 




0.99 






0.94 


0 70 


100 


0 76 


oai 


0,69 


063 


a-4i 


DJ3 


Q 




0.79 


0 45 


D.7J 


0 24 


0 99 


0,99 


0.94 


0.99 


□ 97 


0.64 


0.74 


1.00 


0,74 


OSl 


0.57 


0.38 


0.11 




30 


o.eo 


0.4S 


0.73 


023 


0 99 


0.99 


0 97 


0.99 


0 99 


0 57 


0 60 


0 57 


100 


OSS 


0.50 


0.31 


Q.17 






Q.67 


0 37 


D£9 


019 


0 99 


0 97 


0 7S 


0 98 


□ SB 


0 6S 


0 69 


0 64 


0 89 


100 


060 


0 39 


nj7 






0 72 


0 43 


0 66 


0 23 


0 99 


0 98 


084 


0 99 


□ 91 


0 79 


0 84 


0 79 


0 90 


OSD 


100 


0 53 


□ 30 






0.83 


0 53 


0 79 


0 28 


0 99 


0 99 


0 94 


0 99 


D9S 


0 88 


□ 93 


0 89 


0 91 


as7 


0 89 


LOO 


□ IB 






oso 


0 64 


□ E9 


0 36 








o.„ 




0 93 


036 


0 95 


0 97 


0 93 


0 95 


0 86 


100 






0 94 


0 75 


0 94 


045 


0 99 


0 99 


0 99 


0 99 


100 


013 


0 45 


042 


0 51 


0 16 


0 41 


0 32 


□ 22 






□ 55 


0 39 


0 50 


0 30 




0.65 


as, 


17 




03t, 












a. 20 








1X)0 








0.86 


0 90 


0 63 


0.90 


0.71 






0.66 






























0.97 




0.52 


0,56 


0.59 


O.Gl 


0.51 


0.46 


0.30 


D.17 


0 


19 


0^4 


0.47 


1.00 


0.24 


0.96 


0.9E 


O.SO 


D.96 


□.B7 






059 






0-51 






036 


C£4 






i.oa 


0.91 






0.91 












































































0.4S 













































































































Fraction overlap 



1.00 0.90 0.80 0.70 0.60 0.50 0.40 0.30 0.20 0.10 0.00 



10,000 20,000 

Number peaks 



30,000 0.00 0.05 0.10 0.15 0.20 0.25 0.30 0.35 

FRIP score 



Scale 
Chr5: 



131995000| 




132005000| 132010000| 
ENCODE SYDH GM12878 p300 SC584 Repl 

ENCODE SYDH GM12878 p300 sc585 Repl 



GM12878 p300 sc585 
GM 12878 p300 1F4-E10P Repl 
6M12878 p300 1F4-E10P Rep2 
GIVI12'878 p300 5F7"-C9P 



IL13I I I 



CIV112878 p300 7H5-F2P 

RefSeq Genes 



>):>))>>)))>))>>) ■ >>)):))>>) i ) ^ 



Figure 4 | Characterization of novel monoclonal p300 antibodies using robotic CHIP. ChlP-seq against p300 was carried out in GM 12878 cells and prior 
ENCODE data for it in that cell line (from the "SYDH" production group) was used as a reference. ENCODE data was generated using two 
different rabbit polyclonal antibodies from Santa Cruz (sc-584 and sc-585). We carried out robotic ChIP testing of two different lots of the sc-585 
antibody and 11 different monoclonals we raised. The 1F4-E10P clone scored best and additional replicates were generated in subsequent experiments. 
(A) Number of called regions; (B) ChIP enrichment as measured by FRiP scores (Landt et al. 2012); (C) Overlap between called peaks with different 
antibodies. The overlap score ( Oxy) shown in each box indicates the fraction of peaks in the dataset on the y-axis that are also found in the dataset on the x- 
axis, i.e. Oxy— \Xn Yi/\Y\; (D) Representative browser snapshot of p300 ChIP enrichment in polyclonal and monoclonal datasets around the IL13 and 
1L4 locus. 



cific sequencing protocols being differentially sensitive to the mass, 
fragment size, and other characteristics of a ChIP sample. 

We used R-ChIP to screen for monoclonal ChlP-competent anti- 
bodies for the histone acetyltransferase, p300. Many antibodies made 
against transcription factors fail in ChIP reactions, even though they 
work well in one or more conventional uses (e.g. standard immuno- 
precipitation, western blots or immunocytochemical stains). 
Moreover, polyclonal reagents that are ChlP-seq compatible typ- 
ically vary, sometimes greatly, in their specificity and performance 
from lot to lot^^. The upshot has been that the only way to identify a 
ChlP-quality antibody is to test it directly for ChIP, and the most 
general way to ensure reliability and unlimited supply is with a 



monoclonal. Whether the final readout for ChIP competence is 
DNA sequencing or qPCR (the latter requiring known targets for 
the factor), the capacity to test many ChIP reactions is critical for 
screening. The ChIP peaks identified by our top monoclonal, 
ENCITp300-l (originally called 1F4-E10P during screening) overlap 
highly with prior measurements from ENCODE for the same cell line 
and with concurrent polyclonal determinations, confirming its spe- 
cificity for p300 and the utility of R-ChIP for screening new immune 
reagents for ChIP. This reagent is being made publicly available at the 
Developmental Studies Hybridoma Bank. We note, however, that the 
data obtained with it are not as inclusive as the best datasets produced 
using polyclonal rabbit reagents. 
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For p300 R-ChIP, we used chromatin from cells fixed with a 
modified condition {37'C, 1% formaldehyde, 30 min; personal com- 
munication, Bing Ren, UC San Diego). This improved p300 ChIP 
significantly in our hands compared with our standard fixation con- 
dition (see Results and Methods), while it had no detectable effect on 
NRSF, GABP and H3K27ac ChlP. We also observed that the PU.l 
and ZBTB33 ChlP-seq datasets we generated were not as good with 
37°C chromatin as with the standard condition. Until more factors 
are tested directly under both fixation conditions, we cannot predict 
which antibody-factor pairs will be affected or how. Epitope destruc- 
tion or occlusion, or elevated signals from lower affinity interactions, 
are among the plausible negative effects. The most positive impact is 
expected for proteins that interact indirectly with DNA, as p300 is 
understood to do. 

We anticipate that R-ChIP, and variations on it, will advance a 
wide range of functional genomics studies by relieving the emerging 
data production bottleneck, increasing efficiency and improving 
overall data quality. 

Methods 

Cell growth and harvesting. Cells were grown and harvested following established 
ENCODE protocols (available at http://genome.ucsc.edu/ENCODE/cellTypes.htnil) 
with the exception of GM12878 p300 experiments for which chromatin was fixed at 
37 C for 30 minutes. 

Chromatin preparation. Chromatin was cross-linked by adding formaldehyde 
directly to the cell culture media at a final concentration of 1 % and gently mixed for 1 0 
minutes. The exception was (where indicated) fixation at 37 "C for 30 minutes, which 
was used for p300 experiments. In all cases, the formaldehyde reaction was quenched 
by adding glycine to a final concentration of 0.125 M for 10 minutes. Cells were then 
pelleted, rinsed once in cold phosphate-buffered saline (PBS) with 1 mM PMSF and 
once in cold MC lysis buffer (10 mM Tris pH 7.5, 10 mM NaCl, 3 mM MgCl2, 0.5% 
NP-40, and Roche Complete Protease Inhibitor Cocktail) to obtain nuclear pellets. 
Nuclei were sonicated in RIPA buffer (PBS, 1% NP-40 Substitute, 0.5% Sodium 
Deoxycholate, 0.1% SDS, and Roche Complete Protease Inhibitor Cocktail) at a 
concentration of at least 5 X 10^ nuclei/mL using a probe sonic dismembrator from 
Fisher Scientific (Model 550). To check for fragment size distribution after sonication, 
a small fraction of the sample was reverse cross-linked for two hours at 65 C, purified 
using DNA purification columns from Qiagen, then loaded onto a 2% agarose EtBr E- 
Gel from Invitrogen (Supplementary Figure 9). 

Antibodies used. The following antibodies were used: an a-NRSF mouse monoclonal 
(12C11) from the Anderson Lab at Caltech^'•■'^ an c(-p300 rabbit polyclonal (sc-585) 
from Santa Cruz Biotechnology, a mouse monoclonal a-GABP (sc-28312) from Santa 
Cruz Biotechnology, a mouse monoclonal a-Kaiso/ZBTB33 (sc-23871) from Santa 
Cruz Biotechnology, a rabbit polyclonal a-SPIl/PU.l (sc-22805) from Santa Cruz 
Biotechnology, and a mouse monoclonal a-H3K27ac (306-34849) from Wako. In 
addition 11 c(-p300 mouse monoclonals were generated in the Caltech Mouse 
Monoclonal Facility. Four of the c;-p300 mouse monoclonals were raised against a 
bacteriaUy expressed GST fusion protein containing N-terminal residues 152-213. 
The remaining seven antibodies were raised against a synthetic peptide from 
GenScript containing residues 1526-1545. 

Robotic-ChIP (R-ChIP) workflow. ChIP experiments were adapted from 
methods previously described and optimized for performance in a 96-well plate 
format using a Tecan Freedom EVO 200 liquid handling robot. Reagents and labware 
are placed on deck of the robot (Supplementary Figure 1). 

After setup the R-ChIP workflow is completely hands-off and consists of a series of 
modules with a run time of —24 hours, including the 12-hour reverse cross -linking step. 
All aspects of the setup are checked thoroughly to ensure a smooth run. 

1. Blocking and washing of magnetic beads. The Tecan begins by resuspending 
magnetic beads (Invitrogen M-280 Dynabeads) from the source tube with the 
liquid -handling arm (LiHa) and dispenses the magnetic beads into a Fisher 96- 
Well Deep Well™ Polypropylene known as the ChIP plate. 1 00 f.iL of beads is 
used for a monoclonal IP antibody and 200 f.iL for a polyclonal. The LiHa tips 
are evacuated and rinsed with ddH20 between subsequent dispenses to pre- 
vent cross-contamination. 500 /(L of PBS containing 5% bovine serum albu- 
min (BSA) is then dispensed by the LiHa from a buffer reservoir (Te-Fill) to 
block and wash the magnetic beads. The plate containing the beads is trans- 
ferred to an orbital mixer (Te-Shake) with the robotic manipulator arm 
(RoMa) and mixed several times for 20 seconds with a 20 second pause 
between each mix. The RoMa moves the bead plate to a magnetic plate for 
seven minutes where the beads are then pelleted in a ring allowing the multi- 
channel arm (MCA96) fitted with natural 200 fiL tips from TipOne to aspirate 
liquid. These steps are repeated three more times and include an ethanol rinse 
of the MCA96 tips as needed to prevent cross-contamination. 



2. Binding of antibody to magnetic beads. The LiHa adds 400 ^(L of PBS-BSA to 
the antibody plate bringing the final volume to 500 ^(L. The antibody is then 
added to the beads using the MCA 96 which transfers the diluted antibody 
from the a 2.0-mL 96-well PlateOne V-bottom plate to the ChIP plate. For 
monoclonal antibodies, 5 /ig of antibody were diluted in 500 f.iL (10 jig for 
polyclonals). The beads and antibody are incubated together for one hour with 
mixing using the Te-Shake. Any unbound antibody is then aspirated with the 
MCA96 and deposited into a fresh 2.0-mL 96-well PlateOne V-bottom plate 
for further analysis if needed. 

3. Incubation of chromatin and antibody-bead complex. The MCA96 transfers 
500 fiL of chromatin containing 2.5 X 10^ cells from the Matrix tube rack to 
the Chip plate. The chromatin and antibody bead complex are then incubated 
together for 2 hours during which the ChIP plate alternates between the Te- 
Shake and a 4^ C cool plate using the RoMa. The chromatin is stored in 1.2 mL 
screw-top Matrix tubes that can be arrayed on the chromatin plate as needed. 
Any unbound chromatin is then aspirated with the MCA96 and deposited into 
a fresh 2.0-mL 96-well PlateOne V-bottom plate for further analysis if needed. 

4. Washing of IP complex. The LiHa dispenses 500 ^iL of LiCl wash buffer 
(100 mM Tris pH 7.5, 500 mM Lithium Chloride, 1% NP-40, 1% sodium 
deoxycholate) from the Te-FiU onto the beads, which are then mixed for 20 
seconds with 20 second pauses between each mix. The beads are then pelleted 
and the wash with LiCl buffer is repeated four more times. The LiHa then adds 
500 fiL of TE buffer (10 mM Tris pH 7.5 and 1 mM EDTA) and resuspends 
the beads with the Te-Shake for 20 seconds. Beads are then pelleted with the 
magnetic plate and any remaining buffer is aspirated and discarded by the 
MCA96. 

5. Elation from beads. The LiHa dispenses 100 /(L of IP elution buffer (1% SDS 
and 0.1 M NaHCO^) from the Te-FiU and the beads are resuspended by mix- 
ing for 20 seconds with the Te-Shake. The MCA96 then aspirates the beads and 
transfers them from the ChIP plate to a Hard-Shell Semi-Skirted PGR Plate 
from Bio-Rad. The RoMa transfers a PGR lid from the storage hotel and places 
it on top of the PGR plate then transfers the lidded PGR plate to a DNA Engine 
Peltier Thermal Cycler with Remote Alpha Dock System from Bio-Rad. The 
top of the thermal cycler closes and places force on the PGR plate lid creating a 
seal. The beads are then heated for one hour at 65^G to disassociate the IP 
complex from the magnetic beads. 

6. Reversal of Cross-Unks. The RoMa takes the PGR plate from the thermal cycler 
and transfers it to the Te-Shake to resuspend the beads. The PGR plate is then 
transferred to the magnetic plate for pelleting of the beads. The MCA96 mounts 
50 /(L Tecan Pure Disposable tips from Tecan, slowly aspirates the supernatant, 
and transfers it to a fresh PGR plate. 10 fiL of proteinase K from Epicentre 
diluted 1 : 5 in proteinase K buffer (50% glycerol, 50 mM Tris-HCl pH 7.5, 
0.1 M NaCl, 0.1 mM EDTA, 1 mM DTT, 10 mM CaGl2, 0.1% Triton® X-100) 
is then added to the supernatants with the LiHa. The RoMa places a lid on the 
fresh PGR plate and transfers both back to the thermal cycler for a 12 hour 
incubation at 65''G to reverse the cross-links. Once the incubation is fmished 
the plate is transferred to the deck with the RoMa and the R-ChlP is complete. 

DNA cleanup. Samples from the R-ChlP ChIP experiments presented here were then 
cleaned up manually using the protocol described by Qiagen in their Qiaquick 
PGR purification kit with the addition that the EB buffer is heated to 55^G prior to 
elution and eluted in a 50 ^(L volume using DNA lo-bind 1.5 mL tubes from 
Eppendorf We anticipate automating this step. 

Library building and sequencing. Library buUding for sequencing on the lUumina 
HiSeq platform was performed conventionally with barcoding to allow multiple ChIP 
libraries to be sequenced in a single flow cell lane, according to the HudsonAlpha 
ENCODE Chip protocol (Supplementary Methods). Standard methods were used for 
end repair and dA addition of DNA fragments recovered from GhlPs or chromatin 
controls. The fragments were then ligated to lUumina Paired-End adaptor sequences 
and PGR-amplified to complete the adaptor sequences and introduce a 7-base DNA 
barcode in the 17 position. The barcodes allowed mixing of multiple samples per 
flowceU lane. Control libraries were prepared from 500 ng of DNA from reverse 
crosslinked sonicated chromatin. ChIP library starting amounts varied, with a 
median of 7.5 ng. Fragment size selection was achieved at the lower threshold with 
solid phase reversible immobilization (SPRI) technology to recover dsDNA greater 
than 100 bp after adaptor ligation (thereby excluding unligated adaptors) and at the 
upper threshold with an extension time of 30 seconds during PGR amplification. This 
size selection method consistently produced final DNA library fragments that ranged 
from 1 00 to 400 bp, as determined by BioAnalysis. Final library amounts varied by 
Chip, with a median of 546 ng. 

Detailed description of the protocol can be found in the Supplementary Methods. 

Libraries were pooled in equimolar amounts and sequenced on the lUumina 
HiSeq2000 or HiSeq2500 with 50 bp single-end reads following the manufacturer's 
recommendations. Raw sequencing reads are available from GEO accession number 
GSE53366. 

Data processing and analysis. Reads were aligned using Bowtie*^, version 0.12.7, with 
the following settings: "-v 2 -t -k 2 -m 1 --best --strata", which allow for two 
mismatches relative to the reference and only retain unique alignments, against the 
hgl9 version of the human genome (assembly downloaded from the UGSG genome 



SCIENTIFIC REPORTS | 4:5152 | DO!: 10.1038/srep05152 



8 



browser) with the Y chromosome retained or removed depending on the sex of the 
cell hne. Peak calling was carried out using ERANGE^\ version 4.0, with the following 
settings: "--minimum 2 --ratio 3 --listPeak --shift learn --revbackground", against 
matching control samples. Library complexity was estimated as described 
previously^^. Cross-correlation analysis was carried out using version 1.10.1 of SPP^^) 
and the following parameters: "-s =^0:2: 400". 

All additional analysis was carried out using custom-written Python scripts. 
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